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ABSTRACT 

The halo of M31 shows a wealth of substructures, some of which are consistent with the assembly 
from satellite accretion. Here we report on kinematic and abundance results from Keck/DEIMOS 
spectroscopy in the near-infrared calcium triplet region of over 3500 red giant star candidates along 
the minor axis and in off-axis spheroid fields of M31. These data reach out to large radial distances 
of about 160 kpc. The derived radial velocity distributions show an indication of a kincmatically cold 
substructure around ~ 17 kpc, which has been reported before. We devise a new improved method 
to measure spectroscopic metallicities from the calcium triplet in low signal-to-noise spectra using a 
weighted coaddition of the individual lines. The resulting distribution (accurate to ~0.3 dex down to 
signal to noise ratios of 5) leads us to note an even stronger gradient in the abundance distribution 
along M31's minor axis and in particular towards the outer halo fields than previously detected. The 
mean metallicity in the outer fields reaches below —2 dex, with individual values as low as < —2.6 
dex. This is the first time such a metal poor halo has been detected in M31. In the fields towards 
the inner spheroid we find a sharp decline of ~0.5 dex in metallicity in a region at ~20 kpc, which 
roughly coincides with the edge of an extended disk, previously detected from star count maps. A large 
fraction of red giants that we detect in the most distant fields are likely members of M33's overlapping 
halo. A comparison of our velocities with those predicted by new N-body simulations argues that 
the event responsible for the giant Stream is most likely not responsible for the full population of the 
inner halo. We show further that the abundance distribution of the Stream is different from that of 
the inner halo, from which it becomes evident, in turn, that the merger event that formed the Stream 
and the outer halo cannot have contributed any significant material to the inner spheroid. All these 
severe structure changes in the halo suggest a high degree of infall and stochastic abundance accretion 
governing the build-up of M31's inner and outer halo. 

Subject headings: Galaxies: abundances — Galaxies: evolution — Galaxies: kinematics — Galaxies: 
structure — Galaxies: stellar content — Galaxies: structure — Galaxies: individual 
(M31) 
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1. INTRODUCTION 

With the discovery of the Giant Stream (Ibata et al. 
2001), the mapping of complex structures in the "halo" 
of M31 (Ferguson et al. 2002; Gilbert et al. 2007; Ibata 
et al. 2007), and the isolation of an extended kinematic 
disk structure (Ibata et al. 2005) the idea that the ra- 
dially more distant populations of M31 originate in ac- 
cretion events has become established. Even a subset 
of M31 satellites might relate to the breakup of a mas- 
sive progenitor, based on the polar and planar alignment 
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of a number of its early-type satellite galaxies (Koch & 
Grebel 2006). 

Prior to this paradigm shift in the description of the 
M31 halo, relatively shallow Hubble Space Telescope 
(HST) imagery revealed what appeared to be a mostly 
metal rich M31 halo (e.g. Rich et al. 1996; Bellazini 
et al. 2003). Mould & Kristian (1986) were the first 
to find a metal rich (47 Tuc-like) halo population, using 
ground-based imaging. The widespread presence of this 
metal rich population as well as the descending red gi- 
ant branch (RGB; in contrast to the metal poor globular 
clusters) was noted by Bellazzini et al. (2003). Ground- 
based minor axis star counts appear to show a smooth 
r 1 / 4 spheroid (Pritchet & van den Bergh 1994), and some 
studies have argued that the metal rich halo population is 
an extension of the metal rich bulge (e.g. Mould & Kris- 
tian 1986; Guhathakurta et al. 2006). But alternatively, 
the case has been made that the metal rich stars more 
closely coincide with the perturbed regions (Ferguson et 
al. 2002). 

Metal poor stars around M31 were already indicated 
in photometric studies of its halo at projected distances 
of 7 kpc (Mould & Kristian 1986) to 20 kpc (Durrell 
2001). Subsequent spectroscopic surveys then fully re- 
vealed a complex picture of the halo composition: Using 
the well-established near-infrared calcium triplet (CaT) 
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as a metallicity indicator, Reitzel & Guhathakurta (2002) 
find metal poor stars at 19 kpc on the minor axis. Bene- 
fitting from much larger samples, Chapman et al. (2006) 
and Kalirai et al. (2006a) argue based on kinematics and 
metallicity that the expected metal poor halo population 
is in fact present. Giants with the radial velocity of M31 
are claimed as members of the halo to distances in ex- 
cess of 100 kpc (Ostheimer 2003; Gilbert et al. 2006) 
and an overall metallicity gradient of the M31 halo is 
proposed by Kalirai et al. (2006a). Yet, over much of 
this region, Ibata et al. (2007) find clear evidence of den- 
sity enhancements associated with accretion. What frac- 
tion of the outer halo is then comprised of such recently 
accreted material? In particular the chemical composi- 
tion of the outer stars remains unknown, and depending 
on the adopted [a/Fe] ratio, Kalirai et al. (2006a) es- 
timate a mean metallicity in the outermost field ^0.3 
dex higher than the mean Milky Way metallicity (for 
Solar scaled abundances), or a mean that is comparable 
to the Milky Way halo under the assumption of strong 
a — enhancement . 

In fact, pencil beam ultradeep imaging using the Ad- 
vanced Camera for Surveys (ACS) on board the HST, of- 
fers a complementary picture of the complexity present in 
the halo populations. The placement of these deep imag- 
ing fields has benefitted from the starcount maps and 
Keck/DEIMOS kinematic studies. Brown et al. (2003) 
first demonstrated that a minor axis field projected at 
11 kpc contains an indisputable range in age and abun- 
dance, extending to nearly-Solar, and a predominant age 
range from 6-10 Gyr. Comparing 3 fields, in the inner 
spheroid (at 11 kpc), the M31 disk and on the Ibata et 
al. (2001) debris stream, Brown et al. (2006) find the 
disk to be younger and more metal rich, and lacking old 
stars. They find the stream and spheroid fields to be in- 
distinguishable based on their color-magnitude diagrams 
(CMDs), leading to the conclusion that only one progen- 
itor is responsible for the debris field in the inner halo 
region. However, if dynamical mixing were efficient in 
these regions, it could also erase the signatures from dif- 
ferent sources. Brown et al. (2007) investigate a field at 
21 kpc and find evidence that its population is marginally 
older and more metal poor than the inner halo field. 

The present-day pencil beam surveys have found clear 
evidence of an age range in every field studied and appear 
to support other evidence of a gradient in age and abun- 
dance. What has been lacking to date has been a survey 
of abundances and kinematics that ties these fields to- 
gether and provides a context for the interpretation of 
these deep fields. This is one aim of this paper, and a 
natural extension of our systematic survey of the struc- 
ture and kinematics of M31 along its southwest minor 
axis. 

While it is attractive to seek one massive progenitor 
for the inner debris field there are several arguments 
against this position. First, the existence of both the 
giant stream and the extended disk suggest at least two 
very different sources for the debris field at 11 kpc. Sec- 
ond, the dramatic variation of the extended spatial struc- 
ture of the debris field as a function of metallicity and 
age (Ferguson et al. 2002) is best understood by invok- 
ing multiple events involving different accretors. More- 
over, the distant rotating disk-like population (Ibata et 
al. 2005) is superposed on other, likely unrelated, struc- 



tures that are suggestive of shells associated with the 
giant Stream merger (see also Ibata et al. 2007). It 
is then an intriguing question, whether a corresponding 
measurable abundance change occurs at the point where 
this field ends. 

In the context of CDM models (e.g. Bullock & John- 
ston 2005) halos are thought, in general, to accrete from 
the debris of lower mass satellites. Yet Mouhcine et al. 
(2005) find a correlation between parent galaxy lumi- 
nosity and halo metallicity. Nominally, M31 has a high 
metallicity halo and falls in this relationship alongside 
galaxies with more prominent bulges. There is a para- 
dox: how can the halo of M31 be dominated by stochas- 
tic accretion events, yet still have an r 1 / 4 profile and still 
appear to follow trends set by luminosities of the host 
galaxies? 

We report here the culmination of an observational 
campaign begun in 2002; Table 1 lists the observing run 
details by Principal Investigator. This Paper is organized 
as follows: In §2 we present our observations and the 
standard reduction steps taken, while §3 describes our 
radial velocity measurements. The dwarf/giant separa- 
tion is discussed in §4 and in §5 we devise a new technique 
to measure spectroscopic metallicities from the CaT. The 
following sections are then dedicated to the analysis of 
kinematic (§6) and abundance (§7) substructures and 
gradients in M31's halo. Finally, §8 summarizes our find- 
ings. 

2. OBSERVATIONS AND REDUCTION 

In the course of an ongoing large Keck program (PI: 
R. M. Rich) that aims at elucidating the formation his- 
tory of M31's halo structures based on the kinematics 
and chemical analyses of red giants, we collected a vast 
spectroscopic data set, which covers, amongst others, 
fields on the minor axis of M31 reaching from 9 kpc out to 
large projected distances of ^160 kpc towards the south- 
east 9 . These fields were originally imaged by Ostheimer 
(2003). Two additional fields at 60 kpc on the minor axis 
were obtained in the course of a large DEIMOS survey 
covering a wide set of fields spread across M31's full halo 
and disk components (PI: S. Chapman; see Chapman et 
al. 2006). In this Paper, we will focus on the analysis 
of the minor axis data and those off-axis fields in the 
southeast halo quadrant, while the fields located on top 
of the Giant Stellar Stream (Ibata et al. 2001), and those 
coinciding with the HST fields of Brown et al. (2003; 
2006; 2007) will be the subject of a series of forthcoming 
papers. For details on the overall target selection, obser- 
vation strategy and data collection for the whole project 
we refer the reader to Kalirai et al. (2006a, b) and Gilbert 
et al. (2006, 2007). 

2.1. Observations 

Observations were carried out using the DEIMOS mul- 
tislit spectrograph at the Keck II 10 m telescope over a 
number of observing runs from 2002 through 2006 (Ta- 
ble 1), using a slit width of 1". We used the 1200 line 
mm -1 grating, which gives a dispersion of 0.33 A pixel -1 
and a spectral resolution of 1.41 A, as estimated from 
the width of the sky lines. The majority of the spectra 

9 Throughout this work we will adopt a distance to M31 of 784 
kpc (Stanek & Garnavich 1998) 
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were centered at a wavelength of 7800A, yielding a full 
spectral coverage of -6500-9200A, which comprises the 
dominant near infrared lines of the CaT around 8500A. 
Typical integration times were 1 hour per mask, while 
setup f 109_1 (at 9 kpc) was exposed for 3 hours in total. 
Fig. 1 shows the location of the slit masks discussed in 
this paper on an INT based star count map (M. Irwin, 
private communication; Ibata et al. 2007). Details on 
these masks are given in Table 1. 

2.2. Data reduction 

Reduction of the spectra was performed with the 
spec2d pipeline, which has been designed at the Univer- 
sity of California, Berkeley for the DEEP2 survey 10 . The 
standard reduction steps comprise flat fielding, wave- 
length calibration via arc lamp spectra and sky spectrum 
removal. The total number of extracted science spectra 11 
finally amounts to 3631 (see Table 2), where the signal to 
noise (S/N) ratios typically range from 2 to 60 per pixel 
(although a handful of the brightest foreground dwarf 
spectra reach as high as ^120) with a median of 8.5. 
Fig. 2 displays a number of sample spectra of both high 
and low S/N around the CaT and the sodium doublet 
region, which we will utilize to separate M31 giants from 
foreground dwarfs in Section 4. 

2.3. Photometry 

The photometry of our targets, which will be required 
later on to perform a color-based foreground separation 
and to calibrate our spectroscopic metallicity measure- 
ments, was taken from two sources: For the fields tar- 
geted in the outer regions of M31's halo (R>25 kpc) we 
used the Washington M, DD051 and photometry of 
Ostheimer (2003), which provides a strong separation cri- 
terion for red giant selection (Palma et al. 2003). These 
filters were transformed from the Washington system into 
standard Johnson-Cousins V and I magnitudes by apply- 
ing eqs. 1,2 of Majewski (2000). 

The photometry of targets in our inner fields (R<25 
kpc) on M31's minor axis was, on the other hand, taken 
from the MegaCam/Megapipe archive (Gwyn 2008). 
These data are available in i' and either g' or r' . Typi- 
cal exposure times range from 800 to 3757 s for i', 1600 
to 3200 s for r' and 1445 to 3468 s for g'. Photometric 
errors in the catalog are below 0.1 mag for g' < 24 mag, 
and rise to 0.25 mag at g = 26 mag. The errors in r' are 
well below 0.1 mag for r' < 24 mag and reach 0.2 mag at 
r' = 25 mag. Finally, errors on the i'-band magnitudes 
are below 0.1 mag for almost the entire sample below 
22.5 mag, with a maximum error of 0.15 mag at i' = 24 
mag. The photometric data was then matched to our 
spectroscopic catalog by requiring the coordinates from 
the two sets to match within better than 1". In most 
cases, the match was better than 0.2". 

In order to determine the spectroscopic metallicity of 
each star, V-band magnitudes in the Johnson-Cousins 
system are required. As only the g' or r' and i' filters 
in the photometric system of the CFHT are available, 
a transformation to V magnitudes is determined from 



the latest Padova stellar isochrones (Marigo et al. 2008), 
which are available in the CHFT photometric system as 
well as for Johnson V. In practice, we obtained transfor- 
mations from isochrones with metallicities ranging from 
—2.3 to +0.18 dex and ages of 10 and 12 Gyr. The trans- 
formation from g',i' to V was obtained in four sections: 
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On the other hand, the transformation from r,' i 1 to V is 
defined as follows: 



V = r' + 0.98 {r' + 0.048 

V = r' + 0.19 [r 1 + 0.445 

V = r' + 1.03 (r' - i') - 1.500 



r' - i'< 1.8 
1.8 <r' -i'< 4.0 
4.0 <r' - i' 



(2) 

Typically, these equations are insensitive to the 
adopted metallicity and age of the isochrones and differ- 
ences in V-magnitude from the different isochrones are 
less than 0.05 mag. Only for the super-solar isochrone, 
the overall systematic errors increase to 0.1 mag. To ac- 
count for this we add a 0.1 mag uncertainty in quadrature 
to the final error estimates on the V-band magnitude. 

The resultant CMDs for all targeted objects are shown 
in Fig. 3, separately for M31 giant candidates, foreground 
dwarfs and contaminating background galaxies that were 
separated using the methods outlined in Sect. 4. 

3. VELOCITY MEASUREMENTS 

Radial velocities were measured by cross correlating 
our DEIMOS spectra against a high S/N template spec- 
trum of the bright Kl red giant HD139195, which was 
observed using the same instrumental setup as our ob- 
servations (M. Geha, private communication; Simon & 
Geha 2007). In this way we avoid any systematic uncer- 
tainties occurring from potentially different spectral res- 
olutions and dispersions. The correlation was performed 
using iraf's 12 fxcor task. Preferentially, the entire cov- 
ered spectral region was used in the correlation, where we 
rejected the wavelength regions of telluric absorption, in 
particular the prominent atmospheric A- and B-bands at 
AA 7600, 6860A. For cases in which the correlation of the 
entire spectrum produced weak or no correlation peaks, 
i.e., for the lowest S/N spectra, we restricted the win- 
dows to single narrow regions around prominent absorp- 
tion features, such as the CaT from 8475-8662 A and/or 
the Na doublet from 8179-8200A. Choice of these band 
passes will minimize the contribution of potential resid- 
ual telluric absorption lines (e.g., Schiavon et al. 1997). 

Each correlation peak was examined by eye to avoid 
spurious detections, which might lead to significantly 
erroneous velocity estimates. A Gaussian fit to the 
strongest correlation peak then yielded the final relative 
radial velocity value for the respective spectrum. Finally, 
heliocentric corrections were computed for each star indi- 
vidually to yield heliocentric radial velocities, vhCo that 
we will use for the remainder of this work. 



10 http://astron.berkeley.edu/$\sim$cooper/deep/specld/primer .ht^iRAF is distributed by the National Optical Astronomy Ob- 

11 The pipeline also extracts additional point sources that servatories, which are operated by the Association of Universities 



serendipitously fell on the slit during the exposures. These objects 
are not considered in the present work. 



for Research in Astronomy, Inc., under cooperative agreement with 
the National Science Foundation. 
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The measurement errors on the radial velocities are 
returned by fxcor and are internally computed based on 
the Tonry-Davis i?-value (Tonry & Davis 1979) of the 
cross correlation. Thus our median random velocity error 
amounts to 8.0 kms" 1 . Due to our interactive procedure 
of assessing each spectrum by eye, we were able to discard 
bad spectra and cases in which no correlation peak could 
be discerned on the spot from our sample. These are not 
considered for the remainder of this work. Moreover, red 
background galaxies, which significantly contaminate our 
target sample (of the order of 11% by numbers), were 
identified based on their emission and absorption lines 
and culled from the present sample (see appendix). 

A total of 110 stars were observed on adjacent masks. 
These repeat observations allow us to further assess the 
accuracy of our velocity measurements. We find an over- 
all good agreement between the independent velocity 
measurements of the same stars from different masks: the 
mean deviation is 0.3 kms -1 and the 1 a scatter amounts 
to 8.2 kms -1 , which is consistent with our measurement 

errors. The according reduced \ 2 — V-W J2i ) 

is close to unity. Thus we conclude that our duplicate 
velocity measurements are consistent within the uncer- 
tainties. Moreover, this shows that the formal errors 
returned by fxcor correctly reflect the accuracy of our 
data, also in the light of potential template-target mis- 
matches in stellar type (e.g., Majewski et al. 2004), so 
that there is no need to re-scale these values (cf. Koch 
et al. 2007). As the final velocity for stars with re- 
peat measurements we adopted the error- weighted mean 
of the individual values. As as result, reliable velocities 
could be determined for 2262 of our stars (see Table 2). 

4. MEMBERSHIP SEPARATION 

In order to isolate the true sample of M31 member 
stars from undesired contamination of numerous fore- 
ground Milky Way dwarfs, we utilize the three strongest 
discriminators, viz. V — i' color, the equivalent width 
(EW) of the Na I doublet at AA8189, 8193A, and ra- 
dial velocity. In practice, the EWs of the two Na lines 
were measured by numerically integrating the spectral 
flux within a bandpass from 8179A to 8200A with suit- 
able continuum bandpasses (Gilbert et al. 2006), and 
the errors were obtained by Monte Carlo simulations ac- 
counting for the continuum variance of the spectra. 

To achieve a separation of dwarfs and giants, we fol- 
low Gilbert et al. (2006) in splitting our complete ob- 
served data set into one dwarf- and one giant training 
sample: For this purpose, all stars with radial veloci- 
ties below —200 kms -1 were considered to be M31 red 
giants, whereas those exceeding —50 kms -1 arc most 
likely foreground stars. Since it cannot be assumed a pri- 
ori that the color or Na EW distributions constitute well 
defined analytical, e.g. Gaussian, profiles, we instead de- 
fine empirical probability distributions (PDFs) P from 
the actual observed training samples in color and Na EW 
space. The PDFs were then convolved with the individ- 
ual measurement errors. Likewise, for defining the PDF 
with respect to radial velocity, we adopted a color cut 
at V — i'—2.Q, where stars above this limit are taken to 
be dwarf candidates. In parallel, taking advantage of the 
surface gravity sensitivity of the Na doublet (Schiavon 
et al. 1997), we flagged stars with EWs above 2.5A as 



dwarfs. These empirical limits were chosen because they 
turned out to optimize the separation in the V — i' vs. Na 
EW parameter space. For the M31 giant radial velocity 
distribution, on the other hand, we adopted a one-sided 
Gaussian centered at a systemic velocity of —300 kms -1 
with a radial velocity dispersion of 85 kms -1 , in con- 
cordance with the values of M31's outer halo component 
(e.g., Reitzel & Guhathakurta 2002). All stars with ve- 
locities below —300 kms -1 were assigned a probability 
of unity in the giant training sample. 

Although the number of dwarf stars in this low- velocity 
regime is expected to be well below 1% and their contri- 
bution to the final velocity histograms may be negligible 
(Gilbert et al. 2006), we will turn to a detailed treatment 
of this component in Sect. 4.1. 

Fig. 4 shows the resulting PDFs of the training samples 
as dashed lines. The final probability of any star from our 
sample, with its set of (dhc, V — i' , Na EW), to be either 
a dwarf or a giant was then determined from the PDFs, 
normalized to unity, in each parameter and combining 
these values into a total probability of being a giant: 

fgiant = fgiant(V - %') X Pgiant (Na EW) X P gian t(v H c), 

and likewise for the probability of being a dwarf. If the 
resulting logarithmic likelihood L — log (P g i an t/Pdwarf ) 
is less than zero, the star is considered a dwarf, while 
L > signifies likely giant star candidates (Gilbert et 
al. 2006). The a posteriori histograms in Fig. 4 illus- 
trate our full data set, based on the selection method de- 
scribed above. It is evident that the selected samples in 
all parameters are fully compatible with the pre-defined 
simple training samples and that the number of remain- 
ing dwarf contaminants in our cleaned M31 giant sample 
can be expected to be negligible. There is nonetheless 
a considerable overlap of dwarf and giant stars in color 
space. Moreover, we note the presence of a population of 
stars, flagged as dwarfs based on the full likelihood anal- 
ysis, which exhibit small EWs of the Na doublet. From 
this it becomes obvious that it is in fact necessary to in- 
clude the entire set of available information in color, EW 
and velocity in the analysis to obtain an optimal sepa- 
ration. On the other hand, there will be an inevitable, 
though small, fraction of interlopers that cannot be re- 
liably detected using the traditional separation criteria, 
due to potential covariances between each of the param- 
eters. For instance, we note the presence of one star 
with colors and moderate Na widths representative of a 
dwarf star (as subsequently verified by its spectral fea- 
tures), but with a high negative velocity of —280 kms -1 . 
In this case, the velocity criterion will override the other 
discriminators and lead to classifying this star as an M31 
member. Such cases could be mainly resolved by visual 
inspection of the individual spectra. 

We also note that we refrain in our dwarf/giant sep- 
aration from adopting additional secondary parameters 
such as weaker spectral features (e.g., K I EWs, TiO 
bands, embedded within telluric absorbtion bands) to 
avoid adding further noise to the final PDFs. Further- 
more, we did not weight our combined likelihood by the 
number of available diagnostics to account for potential 
outliers in either of the parameters, thereby yielding a 
statistically more robust and uniform rejection of dwarfs 
(cf. Gilbert et al. 2006). 

To assess the accuracy and efficiency of the separa- 
tion methods described above, we ran a comprehensive 
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suite of Monte Carlo simulations. To this end, each of 
the three indicators was varied 10 times by its measure- 
ment uncertainty, from which subsequently new PDFs 
were built and each target's dwarf/giant status was re- 
determined. As a result, 92% of those stars previously 
classified as giants were still classified as such in 95% 
(2a) of the Monte Carlo realizations. Thus we are con- 
vinced to have obtained a solid dwarf /giant separation of 
our sample. In particular, none of the results obtained 
in this work changes significantly, whether "all" giants 
are included or only those with secure, 2a -classifications. 
The final number of giant candidates per field is listed 
in Table 2. The ratio of giant to dwarf stars decreases 
with increasing radial distance from M31 (Fig. 4, bottom 
right), as expected as the M31 halo density levels off, un- 
til the halo of M33 contributes giants in the outermost 
fields. 

There is still a non-negligible fraction of stars classified 
as giants present in a transitional region around —150 
kins" 1 (~ 1.7 a above the systemic velocity), which pre- 
vails in the outermost minor axis fields (Sect. 6.2). An- 
other noteworthy outcome of our dwarf removal is that 
the addition of velocity as a membership criterion ef- 
fectively deprives the sample of red giants above —100 
kms -1 . This limit already corresponds to removing stars 
that deviate by more than approximately 2.3 a from the 
sample, and we do not expect the presence of any ma- 
jor population of M31 stars in this high velocity regime. 
Considering the limited S /N of the spectra in our dataset 
as well as that of the photometry, we have concluded 
that it is essential to use radial velocity as a dwarf/giant 
separation criterion. As a result, the velocities of our 
dwarf-cleaned sample span a full range from —570 to 
— 100 krns" 1 . Additionally, the constancy of the fore- 
ground Galactic dwarf sample's velocity and dispersion 
(Fig. 11, bottom left) over the entirety of the M31 halo 
strengthens further our case for efficient dwarf/giant sep- 
aration. 

In a recent work, Sherwin et al. (2008) predicted that 
a total number of ~5 hypervelocity stars with velocities 
below —420 kms -1 should be identifiable in M31's halo. 
However, given M31's large overall velocity dispersion 
and the low number of stars with the highest negative 
velocities, it is impossible to resolve, whether any of those 
stars in our sample arc in fact ejected from the center of 
M31, or if they are canonical (2a-) members of its genuine 
halo. 

4.1. Comparison with the Besangon model 

As Fig. 3 shows, there are still a number of stars 
present bluewards of the most metal poor isochrone that 
were classified as M31 giants based on all separators. 
However, it cannot be excluded that a subset of these 
may be blue, metal poor Galactic halo dwarfs, which typ- 
ically have negligible Na doublet lines and a broad range 
in radial velocities (Fig. 5). Thus these contaminants are 
indistinguishable from the giant sample and their sepa- 
ration is insoluble based on the canonical membership 
criteria. It is important to assess the fraction of these 
blue stars, since their systematically weaker CaT lines 
will yield falsified, low metallicities. 

To this end, we queried models of the Besangon Galac- 
tic foreground population (Robin et al. 2003) using color 
cuts and spatial locations in analogy to the observed sam- 



ples. The resulting distribution for the outermost field 
at 160 kpc is illustrated in Fig. 5 (right panel). We chose 
to exemplary plot this outermost field, because it is the 
one for which the Milky Way contamination is expected 
to be the highest. The first thing to note is that there 
is in fact a non-zero population of Galactic stars pre- 
dicted at M31's systemic velocity, with radial velocities 
as low as —420 kins -1 (although we note that the ve- 
locity dispersion in the Galactic model may in fact be 
overestimated). Most of the contaminants are, however, 
distinguishable, either by their high velocities or their 
redder colors. We thus estimate the number of unde- 
tectable blue stars in our sample by determining the pre- 
dicted fraction of dwarfs with vhc —150 kms -1 and 
V — i' < 1 to those dwarf stars in the color- velocity space 
that we are able to distinguish (Fig. 4; Sect. 4). Multiply- 
ing this fraction with the number of our observed giant 
candidates (Tabic 2) then shows that there are typically 
no more than 0-2 blue dwarf stars to be expected per 
field in our giant sample that cannot separated by any 
of the observable criteria (see also Fig. 5, right panel), 
leading to a total predicted number of 35 such contami- 
nants in the entire sample. If present, these will have a 
negligible effect on the more populous, true giant sample. 

Martin et al. (2007) model the CMD of these fields, 
reaching fainter than the limit of our spectroscopy, and 
find no evidence for excess star counts of Galactic mem- 
bers. In principle, dwarf members of the Andromeda- 
Triangulum stream or a potential contaminant may be 
present, especially in the M31 giant-poor outer fields. 
Rocha-Pinto et al. (2004) have measured radial veloc- 
ities of Andromeda- Triangulum stream members, find- 
ing one star with —245 kms -1 , but with most stars 
at higher velocity. Furthermore, the Monoceros Ring, 
whose main sequence stars can overlap with M31 stars 
near the Tip RGB, have a radial velocity that is high 
enough (at —75 kms -1 with a dispersion of 26 kms -1 ; 
Martin et al. 2006a) so that they are not an issue in the 
analysis. Based on these studies, neither of the systems 
poses any risk for contaminating the field with stars at 
the radial velocity of M31. 

5. A NEW METHOD FOR CALCIUM TRIPLET 
METALLICITIES 

In the frequent cases of low spectral resolution and/or 
low S/N ratios the last resort is to measure gravity 
and / or abundance-sensitive indices in band passes a few 
times the spectral resolution, which are then calibrated 
theoretically via a grid of synthetic spectra (e.g., Jones et 
al. 1996). The latter is in general a critical endeavor for 
the CaT, since these lines are formed in the upper chro- 
mospheres of the stars, which ideally requires a full non- 
local thermodynamic equilibrium treatment (Smith & 
Drake 1990; J0rgensenet al. 1992). Detailed model com- 
putations are however sparse so that present-day studies 
of stellar populations mostly rely on empirical calibra- 
tions (Cenarro et al. 2001; and references therein). 

Canonically, the line strength of the near-infrared CaT, 
EFT, has been defined by a weighted sum of the three 
individual lines' pseudo equivalent widths, EW;: 

i=l i=l JBPi \ c.i J 

where F denotes the flux in a predefined set of line band 
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passes (BPi) and F c is the continuum level as determined 
in a set of continuum bands. There is no physical moti- 
vation to prefer any set of the weight factors Wi over the 
other, as long as a consistent definition is used between 
the target red giants to be calibrated and those in the 
calibrator systems, i.e., the high S/N Galactic globular 
cluster spectra. The choice of the weights is mostly gov- 
erned by the spectral quality and measurability of each 
of the Ca lines. In this vein, the most frequently used 
weights throughout the literature are (wi = l Vi; e.g., Ar- 
mandroff & Zinn 1988), (wi = 0, w 2 = 1, w 3 = 1; e.g., 
Armandroff & DaCosta 1991) and (wi — 0.5, W2 = 1, 
w 3 = 0.6; e.g., Rutledge et al. 1997a). The EWs are 
then either determined by numerically integrating the 
spectral flux over the full bandpass or by fitting an ana- 
lytical function F(X) to the line profile. However, both 
methods tend to fail at the lowest S/N ratios, where 
a pure numerical integration merely reflects the noise of 
the spectrum rather than the actual EWs, while the lines 
cannot be reliably fit anymore in the low S/N regime. 
Typically, the limiting S/N for which CaT based [Fe/H] 
measurements are given in the literature lies at 10-15. 

At the low S/N ratios of our faintest DEIMOS tar- 
gets, which reach as low reliable determination 
of the CaT EWs is not feasible. Moreover, the presence 
of sky line residuals around the CaT often leads to artifi- 
cially increased widths if uncritically integrated over the 
respective band passes. In particular, the third of the 
CaT lines at 8662A is susceptible to this increased noise 
component. 

Hence, in order to enhance the S/N in the CaT of each 
individual spectrum, we define a coadded line strength 
<T,W> in that we interchange the order of summation 
and integration in eq. 3: 

<XW>= [ ][> (l- F( V V0 ) d(X-Xo,i). 

J<BP> i=1 \ t c,i J 

That is, each line center is shifted towards a zero wave- 
length before performing a weighted coaddition of the 
three lines in this rest frame, using the identical weight 
factors as in the canonical definition. Mathematically, 
this expression is fully equivalent to the traditional def- 
inition in eq. 3, but it provides the advantage of inte- 
grating the coadded flux, resulting in an increased effec- 
tive S/N. The integration is then carried out over a 
single common band pass. This procedure strictly pre- 
supposes that the individual band passes BPi all have 
the same width and location relative to the line center so 
that BPi — Xi is the same for each of the lines. However, 
we will tie our measurements to a metallicity reference 
scale by using our own suite of measurements in Galactic 
globular clusters. Thus possible differences in the band 
passes will affect the reference spectra and our target 
spectra in the same manner and these will not introduce 
any systematic bias. In the following, we will follow the 
prescription of Rutledge et al. (1997a) by adopting the 
line weights of 0.5, 1 and 0.6. Our coaddition method 
will improve the effective S/N in the CaT region by a 
factor of y/^/Wi, or 1.45, so that we will be capable of 
measuring metallicities even with S/N as low as ~7-10. 
Furthermore, the coadded line is more robust against po- 
tential sky residuals and noise spikes, enabling us to fit 



a line profile (see the illustration in Fig. 6). In prac- 
tice, we fit the resulting line with a Penny function, i.e., 
a Gaussian plus a Lorentz component, which has proven 
to provide the best representation of the line wings (Cole 
et al. 2004). 

In order to tie our CaT metallicity measurements in the 
M31 stars to a reference scale, we performed the identi- 
cal coaddition technique as described above on a sam- 
ple of globular clusters of known metallicity and thus 
derived the CaT line strengths of the globular cluster 
stars. Since no observations of globular cluster stan- 
dard stars were taken for the present project using the 
same instrumental set up as for the M31 science obser- 
vations, we exploited the data set of Koch ct al. (2006), 
which consists of high S/N spectra of 80 red giants in 
four Galactic clusters, NGC3201, 4590, 4147 and 5904, 
obtained with the FLAMES spectrograph at ESO/VLT 
(Pasquini et al. 2002). Since the FLAMES spectrograph 
provides a higher spectral resolution than DEIMOS, we 
degraded the spectral resolution of the FLAMES spec- 
tra to match that of DEIMOS. The final calibration of 
the line strengths onto metallicity was then achieved ac- 
counting for the stars' magnitude above the horizontal 
branch (HB; e.g., Rutledge et al. 1997a, b), where we 
find the following relations 

W' = < T,W> + 0.55 (V -V KB ) (5) 
[Fe/H] CaT = —2.90 + 0.45 W' (6) 

with an r.m.s. scatter of 0.02 dex on the [Fe/H]c a T cali- 
bration of eq. 6. These relations from the FLAMES data 
are shown in Fig. 7. In this calibration we explicitly 
adopted the globular cluster metallicity scale of Carretta 
& Gratton (1997). Furthermore, we assumed a HB ap- 
parent magnitude of 25. 17 mag for the M31 stars, which 
corresponds to the mean magnitude of its halo red HB 
population (Holland et al. 1996). The extent of M31's 
halo will inevitably lead to a spread of distances along 
the sight so that the adoption of a single HB magnitude 
is a simplifying assumption. Ibata et al. (2007) esti- 
mate that, for an extended p(r) oc r~ 2 - 9 density profile, 
the variation in the distance modulus is typically less 
than 0.5 mag. Translating this into a HB spread and 
by applying our calibrations (Eqs. 5,6) this leads to an 
uncertainty of 0.12 dex on the spectroscopic metallici- 
ties. Given the a priori unknown distance of individual 
red giants, this source of uncertainty cannot be elimi- 
nated and will affect all spectroscopic metallicity mea- 
surements in M31. Furthermore, the adoption of the 
HB of the oldest stars will introduce a systematic effect 
on spectroscopic [Fe/H] estimates in the presence of a 
non-negligible intermediate-age population. Koch et al. 
(2006) estimate that the simplification of a single-age 
HB results in in a bias of the order of —0.1 dex, that 
is, the intermediate-age stars may be 0.1 dex too metal 
poor. This is the same order of magnitude that Cole et 
al. (2004) find from their cluster sample, which includes 
a number of young open clusters. Of course, one cannot 
assess which stars are affected to what extent, since their 
ages cannot be assigned a priori. In this context, we note 
that also the M31 globular clusters show a mild trend 
in mean M^HB) vs. [Fe/H]; this may be lessened by 
the brighter HBs of intermediate age populations. This 
trend is 0.4 mag but has a full width of 0.6 mag (Rich et 
al. 2005, their Fig. 12), with a lcr scatter of 0.2 dex. This 
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compares to a small formal random uncertainty of typi- 
cally less than 0.01 dex on [Fe/H]caT that is introduced 
by the photometric errors through Eqs. 5,6. Even an un- 
realistically large error of lmag on a star's V magnitude 
would result in a metallicity error of 0.23 dex. 

We note that the coefficients of our relations in Eqs. 
5,6 are slightly different from the standard calibration of 
Rutledge et al. (1997b), but for the sake of consistency 
between our own measurements and our own suite of 
calibration clusters, we will use these relations for the 
remainder of this work (see also the discussions in Koch 
et al. 2006). 

In order to estimate the random and systematic mea- 
surement uncertainties on our derived coadded CaT 
metallicity, we performed a set of Monte Carlo simula- 
tions (e.g., Simon & Geha 2007), accounting for three 
effects. For this purpose, we added artificial Poisson 
noise to theoretical spectra, consisting of three Penny 
lines with line strengths representative of typical red gi- 
ants as expected in our M31 sample. This noise addition 
accounts for the spectral quality in terms of the S/N 
ratio and the variance of the continuum. Secondly, we 
also added additional random noise peaks and troughs 
on top of the individual CaT lines, thus simulating po- 
tentially bad sky subtraction residuals that hamper CaT 
measurements. Even if one were to assume that the in- 
dividual line profiles were perfect Penny functions, the 
(weighted) sum will not necessary be a Penny anymore. 
By thus measuring the simulated coadded spectrum and 
comparing the obtained line strength with the traditional 
strength of the individual theoretical line profiles in the 
unperturbed spectrum according to eq. 3, we can finally 
estimate the influence of random noise, residual noise 
peaks and deviations from the assumed analytical line 
profile. In this vein, we adopt the standard deviation of 
the difference of the measured <T,W> in the noise added 
spectrum and the T,W of the input spectrum as a mea- 
sure of the measurement uncertainty as a function of the 
spectral S/N ratio and continuum variance. Thus Fig. 8 
depicts the resulting relative uncertainty estimates. Typ- 
ically, the widths measurements are accurate to the 15% 
level at a S/N of 10 and the relative error on the widths, 
a<Y,W> / <Y,W> , generally does not exceed 25% at the 
spectral quality of our DEIMOS data. Applying our cal- 
ibration from above (Eqs. 5,6), this translates into typ- 
ical metallicity errors of (0.32,0.25,0.2) dex at S/N of 
(5,10,20). As another potential source of uncertainty, we 
estimate that a conservative continuum placement error 
of 2% will result in metallicity errors less than 0.1 dex. 

Spectra with too low a S/N ratio to even measure the 
coadded line strengths were discarded from our sample 
on the spot so that we end up with a total number of 
1673 CaT measurements (Table 2). An aggravating fac- 
tor for using the CaT as a proxy for metallicity is the 
onset of strong TiO absorption in the coolest stars. In- 
creasing strength of the TiO band at 8500A progressively 
depresses the CaT lines for redder stars so that the line 
strength T,W starts to turn over towards lower values 
for colors redder than V — i' > 2 (e.g., Garnavich et al. 
1994; their Fig. 13). This ambiguity, which also reflects 
in an overturn of the RGBs of metal rich globular clus- 
ters, prompted us to only include stars with V — %' < 2 
in the present chemical analysis. In this way we avoid 
an underestimate of the metallicities of the reddest stars. 



Thus 64 giant stars with velocities compatible with the 
M31 mean were removed from the CaT sample. We note 
that most of the reddest, nominally most metal poor 
stars that were thus rejected, are mostly found within 
~40 kpc so that they do not affect our later conclusions 
about the overall large-scale radial metallicity distribu- 
tions. 

5.1. On photometric metallicities 

As a secondary estimate, we derived photometric 
metallicities of our stars by following isochrone fits laid 
out in Reitzel & Guhathakurta (2002). For this purpose 
we adopted individual corrections for reddening from the 
Schlcgcl et al. (1998) maps. In practice, we fit a set of 
isochrones to our CMD (Fig. 3), where we employed the 
stellar tracks without a- enhancement from the Padova 
group (Marigo et al. 2008). Our choice to negelect a 
enhancement is justified by the age range in M31's halo 
(Brown et al. 2006): Stellar populations with a wide age 
range have had sufficient time for supernovae of type I to 
contribute iron and therefore the composition trends to- 
ward Solar. Each star's locus in the CMD was then fit by 
a surface, using a 12th order function in both g' — i' and 
i', with full cross terms, to the metallicity. In practice, 
we generated a grid of isochrones from the group's web- 
interface 13 , covering 0.0001 < Z < 0.03 with a spacing 
of 0.001. Moreover, we adopted the set corresponding 
to a population of 12.7 Gyr age, in concordance with the 
oldest populations found in the M31 halo (but see also 
Brown et al. 2003, 2006, 2007). 

As the comparison with our CaT measurements in 
Fig. 9 indicates, the dwarf stars clearly deviate from 
unity, while the metallicities from both methods agree 
better for the giant candidates, albeit with a broad scat- 
ter (see also Kalirai et al. 2006a). The overall agreement 
within this scatter is notable down to the lowest metal- 
licities. We will exclusively rely on our spectroscopic 
metallicities throughout this work, since the photomet- 
ric metallicity estimates are generally prone to a number 
of inconsistencies: Among these are the strong influence 
of photometric uncertainties. The median error on our 
colors is 0.05 mag, but it reaches ~ 0.15 mag for a num- 
ber of our targets. For the more metal rich stars, the 
formal metallicity errors due to photometric errors are 
below 0.05 dex, but they exceed 0.1 dex for stars bluer 
than a g' — i' of ~1.5 and can reach as high as 0.5-1 dex 
for the bluest giants. In this locus, the more metal poor 
isochrones become more and more degenerate and do not 
permit a reliable metallicity determination (see Fig. 3). 
In contrast, the CaT method has its greatest sensitivity 
at the metal poor end. 

For populations with [Fe/H]> —1 dex, the red giant 
branch curves both red and faint in a complex way, due 
to the onset of TiO absorption in the coolest stars (e.g., 
Garnavich et al. 1994). This behavior is not well mod- 
eled and certainly affected by a- enhancement. The as- 
signment of metallicity becomes a complex function of 
both color and magnitude. Moreover, the systematic 
uncertainties on the photometric metallicities will be in- 
evitably large, given the a priori unknown age- and abun- 
dance distribution of our stellar sample that is drawn 
from a patchwork of M31's populations. In this con- 

13 http: //stev . oapd. inaf . it/$\sim$lgirardi/cgi-bin/cmd 
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text, the presence of a broad distribution in stellar ages 
has been confirmed within the inner spheroid fields by 
the deep HST/ACS based CMDs of Brown ct al. (2003, 
2006, 2007), where in fact 30% of the stars in minor axis 
fields at 11 and 21 kpc were found to be 6-8 Gyr old. 

Likewise, the unknown effects of varying distance mod- 
ulus (Ibata et al. 2007) can easily move stars between dif- 
ferent isochrones and pose an additional source of uncer- 
tainty. A plausible distance modulus spread of 0.5 mag 
translates into a 0.12 dex variation in the spectroscopic 
mctallicities. This uncertainty in the stellar magnitudes 
will have, however, much larger effect on the photomet- 
ric values. We will return to the issue of photometric 
metallicities in Sect. 7.1. 

6. VELOCITY SUBSTRUCTURES 

We shall now turn to the analysis of the dwarf cleaned 
M31 velocity distribution. For this purpose, we compare 
our observed data to a new suite of TV-body simulations. 

6.1. Simulating satellite accretion into M31's halo 

In a new set of TV-body simulations by Mori & Rich 
(2008) a satellite galaxy is accreted onto M31, for which 
the self-consistent potential of Widrow et al. (2003; 
their model A) was used. This potential is essentially 
a three component disk/bulge/halo model, the param- 
eters of which were optimized to match M31's rotation 
curve and its surface brightness and velocity dispersion 
profiles. 

Mori & Rich (2008) adopt the orbit of Fardal et al. 
(2006), which was originally constrained to reproduce the 
stellar distribution of the observed giant stream. For the 
accreted galaxy itself a total mass of 10 9 Mq distributed 
as a Plummer sphere with a scale radius of 1 kpc was 
used. The progenitor mass can in general be well con- 
strained by means of the observed thickness of the M31 
disk (at 2^=300 pc; Kuijken & Dubinski 1995) - exces- 
sive masses would have led to an early destruction of the 
thin disk component as we observe it today. In practice, 
the disk was represented by 7.4xl0 6 particles with a to- 
tal mass of 7x 10 10 M Q and a scale length of 5.4 kpc. The 
bulge mass was assumed as 2.5xl0 10 M© (2.6 xlO 6 parti- 
cles), whilst the halo contained 3.2xlO n M (30.8 xlO 6 
particles) with a tidal radius of 80 kpc. Yet, there is 
generally a significant lack of knowledge about the outer 
density profile of the dark matter halo. Therefore, one 
cannot avoid ambiguities in estimating the tidal radius 
and the total halo mass. In particular, the model of Mori 
& Rich (2008) has a smaller radius and mass compared to 
those of earlier studies (cf. 8.8xl0 n M Q and a radius of 
195 kpc; Fardal et al. 2007). However, since the satellite 
orbit is mainly followed within 50 kpc from the center of 
M31, the outer structure of the dark matter halo is of 
little relevance to the dynamics of the satellite. 

In fact, the Mori & Rich (2008) simulation gives results 
similar to those of Fardal et al. (2007) who have a more 
massive dark matter halo, but which otherwise have the 
same satellite orbit and position (by construction) and 
similar disk and bulge mass. 

Apart from the vastly increased number of particles in 
the work of Mori & Rich (2008), the novelty of these sim- 
ulations is the use of a self-gravitating, live disk and bulge 
that respond to the actual infall by particle motions of 
the underlying components. Thus we cannot only trace 



stars that were torn from the satellite in the course of the 
accretion, but also follow the fate of mutually removed 
disk and bulge stars. Hence we may assess how ejected 
disk stars might contribute to the halo in comparison to 
stars originating in the disrupted satellite. Another im- 
portant feature of including the live disk and halo is that 
an accounting can be made of energy input into these 
populations from the collision event. This aspect affects 
the kinematics of the satellite's stars following the colli- 
sion. 

Projections of the simulation at t = 1 Gyr in the stan- 
dard M31 coordinate system are shown in Fig. 10, sepa- 
rately for disk, halo and satellite particles, as well as the 
combined simulation data. As these plots show, the main 
features observed in surface density maps of M31's halo, 
such as the Giant Stellar Stream (Ibata et al. 2001) and 
the bubble-like feature in extension of the stream (the 
Eastern and Western shelves; Ferguson et al. 2002, 2005; 
Irwin et al. 2005; Ibata et al. 2007) are well reproduced 
by the simulations. 

6.2. Minor axis fields 

The results from our simulations are shown in Fig. 11, 
in comparison with the observed distribution of stars on 
the minor axis in the analogous velocity vs. location 
space. It is evident that the simulations yield a charac- 
teristic triangular shape of the satellite particles' veloc- 
ity as a function of projected distance, indicative of its 
disruption during the accretion (e.g., Merrifield & Kui- 
jken 1998). This feature is also discernible in our ob- 
served data (top panel), though to a lesser extent, and 
is consistent with the findings of kincmatically cold, i.e., 
low-radial velocity dispersion, localized substructure in 
Gilbert et al. (2007). While the simulations predict the 
maximum density of this shell-like feature to occur at a 
projected distance of ~ 1.4° (19 kpc), the observations 
indicate an onset of this cold structure slightly inwards, 
at ~ 1.1° (15-16 kpc). Using the Kayes Mixture Model- 
ing algorithm of Ashman et al. (1994), we decomposed 
the (unbinned) velocity data at this location into two 
Gaussian components, thereby verifying the likely pres- 
ence of a considerably colder substructure with a radial 
velocity dispersion of (29±22) kms -1 plus an underlying 
canonical halo component with a dispersion of (110±10) 
kms -1 . The population ratios, by number, of these 
components are 24% vs. 76%, in good agreement with 
Gilbert ct al. (2007) . It has been suggested by Gilbert et 
al. (2007) and Fardal et al. (2007) that such kinematic 
substructure is probably a realization of spatially local- 
ized shelves, which may provide the forward continuation 
of the giant stream. Other kinematic substructures other 
than the distinct V-shape in our data towards the inner 
spheroid are less conspicuous and the inner regions in 
our observations appear smoother compared to both the 
simulations and in the sample of Gilbert et al. (2007). 

6.3. Outer halo fields 

Although the radial scale in Figs. 11 and 12 (top panel) 
was chosen so as to emphasize the substructure and 
build-up of the inner halo within <~ 35 kpc, we note that 
we detect stars that were flagged as genuine M31 red gi- 
ants out to large radial distances of 160 kpc (see Fig. 12, 
bottom panel), thus strengthening the claim for the large 
radial extent of M31's stellar halo by Guhathakurta et 
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al. (2006) and Kalirai et al. (2006a). Nevertheless, the 
number density of confirmed members at large radii is 
sparse (these are addressed in further detail in Fig. 19). 
The outermost field at 160 kpc (mil) contains 16 giant 
candidates with a mean velocity of —180 kms -1 , nine 
of which lie above —200 kms -1 and seven with veloci- 
ties in excess of —150 kms -1 . In the adjacent, second 
most remote field (m8), seven stars survived the criteria 
for being selected as giants, although only one exhibits 
a radial velocity close to M31's systemic mean, while 
the remainder shows velocities between —180 and —120 
kms -1 , thus about of the order of 1.2-1.8<7 above M31's 
mean velocity. While we cannot exclude the possibility 
that these stars are in fact members of M31's outermost, 
extended halo, we labeled their mean velocities as upper 
limits in Fig. 12 (bottom). All in all, there is a progres- 
sively larger relative contribution of stars at higher ve- 
locities with respect to M31's mean towards outer fields. 
Based on their star count maps, Ibata et al. (2007) esti- 
mate that the field covered by our four mil masks should 
contain 0-2 M31 red giants. In fact Kalirai et al. (2006a) 
report on the presence of 3 bona fide giants in this field. 
Given the location of this field at a separation of 4° (~ 
50 kpc) of M33, it is then conceivable that the stars to- 
wards the higher velocity tail are part of the overlapping, 
extended halo component of M33, as suggested by Ibata 
et al. (2007). With its distance from the Sun of 849 kpc 
(Galleti et al. 2004) , M33 is located at a distance of 220 
kpc from M31 (e.g., Koch & Grebel 2006) and its sys- 
temic velocity lies at —180 kms -1 (e.g., McConnachie et 
al. 2006). Judging by the surface brightness profiles of 
Ibata et al. (2007; e.g. their Fig. 28), the contribution of 
M33 stars (coupled with the inevitable foreground com- 
ponent) appears to set in at radial distances > 11° (~150 
kpc). 

6.4. The halo 's merger origin 

It is then intriguing to ask, to what extent the full halo 
of M31 has been assembled via the accretion of one or 
more accretion events such as the one simulated in Mori 
& Rich (2008). Is there a necessity to invoke more such 
mergers or is there evidence of several disruptive events 
involving many smaller satellite systems? In fact, Ibata 
et al. (2007) have revealed a wealth of substructures and 
stellar streams that haunt the full extent of M31's halo, 
thereby complicating the interpretation of this halo as a 
single, smooth entity. Therefore, we show in Fig. 13 our 
observed radial velocity distributions against the simu- 
lated ones as a function of radial distance. The simu- 
lation particles shown were selected from locations that 
coincide with our observed fields, but we inflated the re- 
spective selection boxes, where necessary, so as to ensure 
the same number of observed and simulated stars for the 
comparison. 

The first thing to note is the presence of two major 
velocity peaks in the simulated data, which reflects the 
wrap of the material stripped from the disrupted satel- 
lite galaxy around the M31 disk, as is in fact observed 
in the form of the Giant Stellar Stream. The regions 
of the aforementioned cold substructure are contained 
within the R = 17 kpc histogram, yet there is no clear re- 
semblance between observations and simulations in this 
regard. While the simulations still show the apparent 
kinematic bifurcation with pronounced peaks at —400 



and —200 kms , the observations indicate the narrow 
population peaking at M31's systemic velocity of —300 
kms -1 , with an underlying, smoother and broader gen- 
uine halo population. To this end, we also plot in Fig. 13 
(dashed lines) the contribution of original M31 halo stars 
from our simulation (again selected from the same spa- 
tial locations as above), i.e., those, which are not re- 
lated to the disrupting satellite in any way. It is striking 
that a major part of our velocity histograms at all radii 
closely resembles this genuine halo component, and that 
the strongest deviations of the velocity distributions from 
the simulations occur towards the fields at approximately 
13-17 kpc. Furthermore, there is a clear discrepancy be- 
tween the model predictions and the observed distribu- 
tion in the innermost spheroid (<13 kpc). This kinematic 
evidence suggests that the inner spheroid of M31 cannot 
entirely consist of debris from one collision like the one 
responsible for the Giant Stellar Stream. In fact most of 
the observations at all radii appear to strongly overlap 
with the prevalent M31 halo component, again showing 
that the accretion of a massive satellite galaxy did pre- 
sumably not single-handedly drive the build-up of the 
inner halo of M31. 

Moreover, the discovery of at least 4 major streams 
perpendicular to M31's minor axis, which also intersect 
our observed fields (Ibata et al. 2007) clearly shows that 
essentially all of our targeted fields are contaminated by 
a more or less significant fraction of stars originating in 
the mergers of the systems responsible for these streams 
(Chapman et al. 2008). Although the exact shape of 
the simulations' velocity distribution depends on the ex- 
tensive parameter space of progenitor structure, its kine- 
matics and orbit, the overall trends and discrepancies be- 
tween model and observations discussed above will not 
sensitively rely on such parameter variations and are ex- 
pected to persist, in particular, since the Mori & Rich 
(2008) model reproduces the structural features in the 
halo observed in the surface brightness maps remarkably 
well. 

The observed velocity distributions (Figs. 12,13) in the 
Giant Stream fields H13s and a3 (at 21 and 32 kpc) show 
the clear signatures at —520 and —400 kms -1 (H13s) 
and —450 kms -1 (a3). In accordance with Kalirai et al. 
(2006b) we find the H13s peak at -520 kms" 1 to be more 
prominent relative to the one at higher velocities. It is 
then noteworthy that, while well predicting the Stream 
component at —400 kms -1 at ~17 kpc (bottom left panel 
of Fig. 13), our simulations and the models of Fardal 
et al. (2006, 2007), on which our orbit is based, fail 
to reproduce the primary, low-velocity peak prominently 
seen in the 22 kpc histogram, as well as the outer stream 
field in the data 30 kpc. This feature is, however, well 
reproduced by the simple orbit model by Ibata et al. 
(2004). 

Interestingly, neither disk, nor bulge stars, ejected dur- 
ing the simulated merger event contribute any consid- 
erable fraction to any of the potential substructures in 
the velocity histograms. Only a few of these ejected 
disk stars are to be found within the innermost 10 kpc: 
The number ratios of model particles within a region 
corresponding to the observed field fl09 are 0:2:280:1 
(bulge/disk/halo/satellite). None of the disk particles 
ventures any further out. At a total of 158 giant stars 
in this field, we would expect no more than 1-2 stars 
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ejected from the M31 disk. 

7. ABUNDANCE SUBSTRUCTURES - A STRONG 
GRADIENT 

In Fig. 14 we show our metallicity results from the new 
coaddition measurements (Sect. 5) as a function of radial 
velocity and radial distance. It is worth noticing that the 
distribution of these spectroscopic metallicities with ve- 
locity (left panels) appears to be a powerful dwarf/giant 
discriminator that can be efficiently employed in cases, 
where not all of the traditional indicators are available 
(cf. Sect. 4). In this representation, the dwarf stars 
occupy a narrow range above > —150 kms" 1 around 
"[Fe/H]"~ —2 dex. Given the lack of knowledge of their 
distances, their assignment relative to a HB magnitude 
becomes meaningless. It is also far from self-evident that 
the CaT linestrength correlates with metallicity in the 
dwarf stars as in the giants. For this reason, the tradi- 
tional calibrations (Rutlcdge ct al. 1997a, 1997b; eqs. 
5,6) are not valid for dwarfs anymore and their applica- 
tion leads to their clear separation in Fig. 14. 

Contrary to the strong clumping in velocity space 
(Figs. 11,12), there is no apparent population substruc- 
ture discernible within the inner fields of R <20 kpc, 
nor in the Stream fields at 21 and 32 kpc. More strik- 
ing is the sudden decrease in the mean metallicity to- 
wards larger projected radii that becomes already visible 
in this representation. It is at these distances, where 
a transition from the dominant, more metal rich bulge 
population towards a more metal poor halo component 
may occur (Ostheimer 2003; Kalirai et al. 2006a). This 
view would also conform with the claim of a break in 
M31's surface brightness profile between an inner i? 1 / 4 
profile (de Vaucouleurs 1958) and a gradual transition to 
an R~ 2 power-law surface brightness profile that defines 
an outer halo (Irwin et al. 2005; Guhathakurta et al. 
2006; Chapman et al. 2006). Interestingly, the run of 
the strength of the TiO band 14 at 7100A (top left panel 
of Fig. 15) lends strong support to this scenario. While a 
number of TiO-strong, thus more metal rich, giants are 
predominant in the bulge and inner halo regions, these 
are clearly missing in the outer parts beyond ^50 kpc. 

In Fig. 15 (top right panel) we separate our full data 
set into three radial bins, comprising the inner spheroid 
(R <20 kpc), the transitional region (20-40 kpc) and the 
outer halo fields from 40 kpc out to our last data at 160 
kpc. There is a clear indication that the outer fields are 
distinctly more metal poor by more than 1 dex compared 
to the inner spheroid: while the inner 20 kpe's metallicity 
distribution function (MDF) peaks at ~ —0.6 dex, the 
mean metallicity shifts progressively towards ~ —1.2 dex 
for 20 kpc< R <40 kpc. This region already contains a 
considerable component below —2 dex, which becomes 
the characteristic metallicity regime in the outermost 
fields. A K-S test reveals that the MDFs from all three 
regions arc unlikely to originate from the same parent 
distribution, with all probabilities being consistent with 
zero. This suggests that there is in fact a mixture of pop- 
ulations, possibly from the inner spheroid and an outer 

14 The Ti07ioo band pass is a classical discriminator of spectral 
types (O'Conncll 1973) and we measure its strength by a straight 
integration of the line band pass from 7055— 7245A with respective 
continuum bands. Larger values of TiOnoo indicate the presence 
of cool, metal rich giants. 



halo component. Nevertheless, we note that the disper- 
sion of the MDF (as derived from an iterative Gaussian 
likelihood estimator accounting for measurement errors; 
Koch et al. 2007) docs not increase considerably due 
to the potential overlap of two populations of separate 
peak metallicities: at (0.47±0.02) and (0.46±0.03) the 
dispersions are practically indistinguishable. However, it 
is perilous to compare these numbers, when splitting the 
sample in only these two regions along a broad spatial 
range. 

Thus we plot in the middle panel of Fig. 16 the disper- 
sion in metallicities versus radius in smaller radial bins 
that were, for R<40 kpc, chosen such as to maintain the 
same number of stars (viz. 100). This way, we guarantee 
a proper statistical sampling; furthermore, since the data 
were analyzed in a homogeneous manner, no systematic 
biasses will be introduced when averaging data across 
adjacent masks. Only for the three outermost fields be- 
yond 85 kpc do we group the sparse data by field (11, 5, 
and 10 giants in m6, m8, and mil) to avoid averaging 
across the large radial gap. These fields are separated by 
40 kpc (already the size of the entire inner spheroid) and 
it is further likely that these fields are contaminated by 
both substructure and the halo of M33. 

The bin at 16 kpc shows a metallicity dispersion that 
is higher by 0.24 dex than the average, while the adja- 
cent bin at ^18 kpc again exhibits an average disper- 
sion. It is worth noticing that the dispersion in our data 
around 13-14 kpc shows a remarkably smaller internal 
(i.e., accounting for measurement errors) dispersion in 
[Fe/H]caT of a = 0.21 ± 0.03 dex (at a mean metallicity 
of —0.84 ± 0.07 dex). In fact, this region corresponds 
to the approximate location of the kinematic substruc- 
ture discussed in the previous section and by Gilbert et 
al. (2007). Without applying any further velocity cuts 
to extract substructure stars by isolating their triangular 
distribution in velocity space, these regions are expected 
to contain an overlap of giants from the intrinsic inner 
spheroid population and those forming the substructure 
itself. As Gilbert et al. (2007) argue, the resulting, in- 
termixed MDF should be expected to be slightly more 
metal rich than adjacent other fields. The radial bin in 
question is slightly more metal rich on average by ~0.11 
dex than the next inner data (at a significance of 0.9a). 
Hence, our mean abundances in the inner spheroid and 
the substructure do not lend strong support to the cold 
structure being significantly more metal rich than its sur- 
roundings. 

However, the mean [Fe/H]c a T significantly drops by 
about 0.4 dex (2.5a) between approximately 16 and 20 
kpc (Fig. 16), while this trend even proceeds further out- 
wards. We note that this sharp drop coincides with the 
edge of disturbed region visible in the Ferguson et al. 
(2002) maps. As Ibata et al. (2005) show, the majority 
of the photometric contrast in the Ferguson et al. (2002) 
maps, leaving the impression of an edge, is due to an 
extended, rotating disk component (see also Fig. 1, top 
panel). As this structure has a high [Fe/H], the transi- 
tion that we map in the metallicities is mostly a result of 
moving off the rotating component and into the underly- 
ing halo dominance, where the halo falls off much more 
slowly than the exponential disk. 

Beyond 40 kpc, the MDF contains 108 stars (Fig. 15, 
top right), but it is evident that the majority of these 
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M31 red giant candidates are more metal poor than the 
crossover region by ~ — 1 dex. In particular, the 60 kpc 
fields provide a transition from the metal poor outer- 
most fields and the more metal rich inner parts, which is 
to be expected as they are dominated by the more metal 
rich tangential streams they lie on (see Fig. 1, bottom 
panel; Chapman et al. 2008). All in all, there is a pro- 
nounced metallicity gradient seen throughout our fields 
within M31's spheroid, which becomes even more strik- 
ing in the plot of individual metallicities as a function of 
distance from M31 (bottom panel of Fig. 15). 

7.1. Comparison with previous detection of a gradient 

A distinct metallicity gradient in M31's halo has al- 
ready been proposed by Kalirai et al. (2006a), who find 
a more gradual leveling of their MDFs, based on pho- 
tometric metallicities, from —0.47 dex within 20 kpc to 
—0.94 dex around 30 kpc down to —1.26 dex beyond 
60 kpc (open squares in the top and bottom panels of 
Fig. 16). Thence their data imply a smooth decline as 
(-0.77±0.09) dex (100 kpc)- 1 . The analysis of our data, 
on the other hand, yields a steeper radial metallicity gra- 
dient of the order of (-1.50 ±0.08) dex (100 kpc)" 1 . Our 
finding of a clear abundance gradient is underscored by 
the disappearance of TiO strong, metal rich giants in 
the outermost fields (Fig. 15, top left panel). In fact, 
the outermost radii appear to be dominated by a purely 
metal poor population. Another significant difference be- 
tween the gradients derived in this work and suggested 
by Kalirai et al. (2006a) is the overall lower metallicity at 
almost any given radius in our analysis. On average, our 
mean abundances in those fields in common with their 
data are more metal poor by 0.75 dex. Given the quoted 
measurement errors from both sources, this discrepancy 
is significant at the 3.4 a level and the reason for such a 
deviation merits careful investigation. 

In the top panel of Fig. 16, we only include the minor 
axis data, which were radially binned such as to guar- 
antee the same, statistically significant number (~100) 
of stars per bin. Our data and those of Kalirai et al. 
(2006a) additionally includes off-axis fields, which we 
highlight with the encircled points in the top panel of 
Fig. 16. Two of these are located directly on the Gi- 
ant stream (H13s and a3), while also al3 and bl5 fall 
towards the edge of the stream feature at a projected 
distance of <~ 50 kpc. It is known from previous spectro- 
scopic measurements (Guhathakurta et al. 2006; Kali- 
rai et al. 2006b) that the stream is intrinsically more 
metal rich than the halo. One other field in Kalirai et 
al. (2006a) lies on the dwarf spheroidal (dSph) galaxy 
And III (d3), which exhibits a population representative 
of M31's moderately metal poor satellite galaxies (with 
a mean abundance of ~ —1.7 dex; McConnachie et al. 
2005). As the field may well be contaminated with dSph 
members, we do not consider it a representative choice 
for a study of the pure M31 halo. Due to our inclusion 
of the inner spheroid fields, we have significantly greater 
numbers within 20 kpc. In order to evaluate the influ- 
ence of the off- axis fields on the metallicity gradient, we 
include in the bottom panel of Fig. 16 all our measure- 
ments throughout M31's spheroid, separated by field (see 
also the color coded map in Fig. 1, bottom panel). These 
fields include H13s, a3, al3, al9 and bl5, which were also 
targeted by Kalirai et al. (2006a). Nevertheless, it is ob- 



vious that the strong character of the gradient persists 
disregarding whether we focus on the minor axis or on the 
whole spheroid. Thus the metallicity gradient is likely a 
characteristics associated with M31's full halo. 

One major source of uncertainty is in general the use 
of photometric metallicities. Larger photometric errors, 
the choice of the adopted set of isochrones, the general 
failure of stellar evolutionary tracks to simultaneously re- 
produce the major features of CMDs (Gallart et al. 2005; 
and references therein) and undesired age-metallicity de- 
generacies on the RGB render photometry the less reli- 
able metallicity indicator as compared to spectroscopic 
estimates. Isochrone fits produce values for the actual 
stellar metallicity, Z, which incorporates the admixture 
of heavy elements, in particular the a-elements. In this 
vein, any unknown a-enhancement can considerably al- 
ter the derived photometric metallicities. Kalirai et al. 
(2006a) demonstrate that an [a/Fe] ratio of +0.3, as 
found in the Milky Way halo, yields results more metal 
poor by 0.22 dex compared to those derived from scaled- 
solar isochrones (but see also Koch et al. 2006 for a dis- 
cussion of the systematics of a- variations on CaT metal- 
licities). In order to reconcile the spectroscopic (this 
work) and photometric (Kalirai et al. 2006a) metallici- 
ties a strong enhancement in these elements of at least 1 
dex would be required. This seems an unreasonably high 
value, even if the formation of the outer halo regions were 
dominated by early star formation bursts. 

Photometric metallicities are especially susceptible to 
overestimating the metallicities of metal poor stars. For 
stars with [Fe/H]< —1.5 dex, there is relatively little 
difference in the metal line blanketing in the V band: 
In color-magnitude diagrams such as Fig. 3, a 1 dex de- 
crease in metallicity causes little discernable change in 
color. Given that for many distant systems, the only ac- 
cessible metallicity estimate is photometric metallicity, 
we would thus urge the use of bluer filters, and greater 
caution in the interpretation of results. It is reassuring, 
though, that despite the large overall discrepancy be- 
tween the Kalirai et al. (2006a) and our measurements, 
there is a significant agreement between the two studies 
in field m8 (at 120 kpc). In this field, we find 7 giant can- 
didates, for 5 of which we could measure an [Fe/H]caT- 
If we discarded those two stars with the largest veloci- 
ties that are in the transition higher-velocity tail of our 
distribution (~l.2-l.5cr above the systemic mean), the 
mean abundance would even drop by another 0.1 dex, 
confirming the dominance of metal poor of stars in these 
outer fields. We note in passing that m8 constitutes one 
of the fields for which Ibata et al. (2007) exclude the 
occurrence of any substructure, so that it may in fact be 
a contender for representing M31's genuine underlying 
metal poor halo (see also Chapman et al. 2006, 2008). 

Finally, we add to Fig. 16 the mean spectroscopic 
[Fe/H] in a minor axis field at 19 kpc, based on the 29 
confirmed red giant members of Reitzel & Guhathakurta 
(2002; their Fig. 17). These authors not only find that 
their spectroscopic metallicities are systematically lower 
than the photometric counterparts, but they also de- 
tect a distinct metal poor tail in this MDF, reaching 
as low as [Fe/H]caT = —2.85 on the scale of Carretta & 
Gratton (1997). By comparing their MDF to those of 
Galactic and M31 globular clusters and of Local Group 
dSph galaxies, they attributed its shape and the pres- 
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ence of such metal poor stars to the build-up of M31's 
halo from the accretion of many a small subsystems. As 
this comparison with our data shows, the mean metallic- 
ity in the Reitzel & Guhathakurta (2002) field is in good 
agreement with our measurement in the respective radial 
bin. The most recently discovered faint dSph satellites 
around M31 are predominantly characterized by mean 
metallicities between —1.3 and —1.7 dex (Martin et al. 
2006b; Majewski et al. 2007), which agrees well with 
the field star population's abundance over a radial range 
from -20-50 kpc. 

7.2. Andromeda's metal poor outer halo 

In contrast, the significant discrepancy of the mean 
metallicity estimates in the outermost field, mil, is an 
issue of concern: While Kalirai et al. (2006a) state a 
value of — —0.92 dex from their three confirmed red gi- 
ant members, we find a value as low as — —2.6 dex from 
10 red giant candidates stars in total. It is then fair to ask 
how reliable are our low metallicities in this highly fore- 
ground contaminated field and how trustworthy is our 
detection of metal poor stars at all radii? First, if those 
marginal stars with potentially too high velocities were 
removed from mil, the mean value in this field would 
essentially remain unaltered - the metal poor nature and 
strong gradient do persist. All in all, there are 56 stars 
with [Fe/H]caT < -2.3 found at all radii from 9 to 160 
kpc. Based on their higher radial velocities, no more than 
6-8 of these could be potential remaining foreground con- 
taminants. Furthermore, it was verified by visual inspec- 
tion that none of the stars is a potential mismatch in 
spectral type or exhibits spurious noise peaks. Also, we 
note that the median log-likelihood L (Sect. 4) amounts 
to 0.7 for these stars so that it is 5 times more likely that 
they are M31 red giants than Milky Way stars based on 
the adopted discriminant indicators. As we have shown 
in Sect. 4.1, there may be ~35 undetectable blue dwarfs 
in our whole sample (Table 2), based on the Besangon 
predictions. Furthermore, the model predicts that only 
15% of these stars have nominal metallicities below —2.3 
dex, which translates into —5 such undetectable contam- 
inants in our sample. 

The principal caveat against deriving a metal poor tail 
in stellar populations is that, generally, no calibrations of 
the CaT strength exist below —2.1 dex. Neither the orig- 
inal sample of Rutledge et al. (1997a, 1997b) included 
any system more metal poor than —2.02 dex (on the scale 
of Carretta & Gratton 1997), nor did our calibration clus- 
ters (Fig. 7). Nevertheless, the CaT technique is widely 
applied throughout the literature of metal poor stellar 
populations and can still maintain its prime role in at 
least a relative ranking of stars towards the metal poor 
extrapolations (Koch et al. 2006; Simon & Geha 2007; 
Battaglia et al. 2008) 15 . We plot in Fig. 17 a series of 
spectra that were grouped and coadded in various metal- 
licity bins. It becomes obvious that the expected trend 
of increasing CaT line strength with increasing [Fe/H] is 
also visible in our data so that our metallicity scale and 

15 The same holds for the calibration of metal rich stars: 
the most metal rich globular cluster in our sample, M5, has 
[Fe/H]cG97 = —1-12 dex), while a small fraction of our measured 
M31 stars nominally reach above solar values. That the CaT cal- 
ibration is still valid up to +0.47 dex has recently been demon- 
strated by Carrera et al. (2007). 



ranking derived from the CaT coaddition did not intro- 
duce any grossly falsified results, in particular towards 
the metal poor spectra. This coaddition of the spectra 
also emphasizes a number of Fe I and Ti I lines that 
clearly scale with metallicity in our spectra and will al- 
low us to estimate chemical abundance ratios in future 
works (see also Kirby et al. 2008). 

As a further comparison, we generated synthetic spec- 
tra of red giants, using Kurucz model atmospheres 16 
with representative stellar parameters, i.e., (T e g=4000 
K, log g=1.0, £=1.5 kms -1 ), and solar-scaled opacity dis- 
tributions 17 (e.g., Koch et al. 2008). In Fig. 18, we show 
the resulting syntheses for different metallicities of the at- 
mospheres. These spectra have been degraded to match 
the spectral resolution of DEIMOS and, in the bottom 
panel, convolved with an additional noise component the 
mimic a representative spectrum with S/N=10. Apart 
from the expected weakening of the CaT, there is a visi- 
ble decrease in the strength of the weak Fe I absorption 
features (e.g., at AA8514.1, 8468.4, 8621.6, 8674.7A) ad- 
jacent to the CaT (see also Bosler et al. 2007) towards 
lower metallicities. This decrease is even discernible in 
the low S/N spectra and in fact observable in our coad- 
ded spectra (Fig. 17). 

One might then argue that the metal poor stars could 
be an artifact of the new coaddition technique and its 
calibration devised in Sect. 5. To test this, we grouped 
the spectra of progressively metal poor stars, and applied 
the same method as before on the coadded spectra. It 
transpires that the metal poor character of these spec- 
tra persist and there is a tight, close-to-unity relation 
between the metallicity bin of the individual stars to be 
grouped, and the final coadded estimate from the coad- 
ded spectra. Moreover, the use of our own consistent 
calibrations (eqs. 5,6) does not alter the qualitative de- 
tection of metal poor stars: if we were instead to use the 
canonical calibration of Rutledge et al. (1997a, 1997b), 
we would observe a shift of the measurements towards 
[Fe/H] lower by 0.2 dex at —3 - if anything, our cali- 
bration would overestimate the metallicities so that the 
metal poor character of the distribution remains. Fur- 
ther tests that we employed to ascertain the reality of 
the gradient and the metal poor stars verified that there 
is no unusual trend of the [Fe/H] with magnitude dis- 
cernible, nor does the gradient change when we restrict 
the analyses only to the high- or low- 5/ AT spectra, re- 
spectively. Moreover, to test the influence of potential 
remaining dwarf contamination, we constructed another 
test sample by only including stars with radial veloci- 
ties below —300 kms -1 . Apart from increased statistical 
uncertainties due to the decreased sample size, the outer 
regions do remain metal deficient and the gradient perse- 
veres. On average, the "pure", velocity restricted giant 
sample yields mean metallicities in each field that are 
more metal poor by 0.04 dex (r.m.s. scatter of 0.50 dex) 
on average than the full giant data. Essentially, the same 
holds if we inflict a strict color cut to select a "pure" gi- 
ant sample: a gradient persists even for a, say, V — i' > 1 
subset. Neither are any significant changes found if we 
restrict the analysis only to giants classified as such in 
> 95% of the Monte Carlo runs of the dwarf/giant sep- 

16 http : //kurucz . harvard . edu| 

17 http: //wwwuser . oat . ts . astro . it/castelli 
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aration (Sect. 4). The median difference between the 
mean metallicities from "all" giants and the 2a cases is 
< 0.01 dex (r.m.s. 0.08 dex). Moreover, our dwarf sam- 
ple does not exhibit any significant sign of a gradient (at 
0.06±0.04 dex (100 kpc)- 1 ), which is expected, since the 
CaT is no metallicity indicator for these stars so that 
their [Fe/H] arc randomly distributed. 

The color coded CMD in Fig. 3 then verifies that 
the trend of metallicity with location on the RGB is in 
fact as expected, with the more metal rich stars (red 
points) exhibiting progressively redder colors, and the 
most metal poor stars (blue points) being predominantly 
located towards blue colors. We also note the presence 
of ^40 giants that fall blucwards of the most metal poor 
isochrone in Fig. 3. Given our discussion in Sect. 4.1, it 
appears unlikely that these are Galactic contaminants. 
Although their spectroscopic metallicities indicate them 
to be metal poor objects, their remarkably blue colors are 
surprising. If one were to assign these colors to erroneous 
photometry, and thus redden these objects towards the 
[Fe/H]=— 2.3 isochrone, this color difference translates 
(Eqs. 5,6) into a spectroscopic metallicity uncertainty 
of less than 0.30 dex with a median of 0.03 dex (r.m.s. 
scatter 0.09 dex). All in all, we are left to believe that 
there is in fact a detectable, real population of consider- 
ably metal poor red giants present in M31's inner halo 
that becomes yet more prominent in its outer halo. 

This fact then confirms the view of an accretion origin 
of the inner halo, presumably by several events, and ar- 
gues strongly in favor of the same mechanism governing 
the formation of the inner and the outer halo. Moreover, 
there is additional evidence (Chapman et al. 2008, in 
prep.) that halo fields at 110 kpc are not affected by any 
of the substructures in the Ibata et al. (2007) maps and 
are consequently a plausible true halo component; they 
are found to have [Fc/H]^ —2. 

Given that the halo of M33 itself is also metal poor 
(at around —1.5 dex; McConnachic et al. 2006) it is 
again feasible that a major fraction of the red giants in 
the outermost fields, are members of M33, the more so, 
since their velocities appear to be similar to the systemic 
velocity of M33. At present, there is no compelling ev- 
idence against the hypothesis that all the candidate gi- 
ants in these distant fields > 120 kpc are genuine M33 
members. Fig. 19 (left panel) shows the metallicities and 
velocities of stars in the three most distant fields, where 
we schematically overplot each galaxy's velocity distribu- 
tion, using the stellar halo parameters from McConnachic 
et al. (2006) for M33 and those derived in this work for 
M31. Under the simplifying assumption that these fields 
contain an equal mix of M33 and M31 stars, we can es- 
timate, based on our targets' velocities, that 85% of the 
red giant members in field mil and m8, and 37% of those 
in m6, might actually belong to M33's halo. Statistical 
removal of this contribution would yield mean metallici- 
ties of (-1.94 ± 0.52, -2.00 ± 0.60, -1.60 ± 0.38) dex in 
m6, m8, and mil. However, there is no way of reliably 
separating the mutual overflow of giants into each other 
galaxy's halo at present. 

The more metal rich stars at [Fe/H]> —1.2 dex and ve- 
locities around —325 kms -1 , on the other hand, might 
plausibly be considered to have originated in a radial 
collision, presumably like the one that bore the ancient 
Giant Stream (see Mori & Rich 2008). This hypothesis is 



also consistent with them being close to the systemic ve- 
locity of M31. In particular, there are no stars found with 
very high negative velocities relative to M31 - all giants 
in these fields have velocities well within ~ 1.5<r of the 
M31 mean in the negative velocity tail (which does ac- 
count for M31's radially decreasing velocity dispersion). 

8. CONCLUSIONS 

From a spectroscopic analysis of 1316 confirmed red 
giant stars along the minor axis on M31 and in spheroid 
fields out to 160 kpc we find and confirm the following 
structural specific features in M31's inner and outer halo: 

1. There is evidence of an abundance substructure in 
the sense that the mean stellar metallicity strongly 
declines at ^20 kpc, where the abundance range 
within this radius is typically —0.5 to —1 dex and 
falls towards -1.4 dex at 20-40 kpc. The latter 
values are consistent with those detected within 
a smooth underlying M31 halo (Ostheimer 2003; 
Guhathakurta et al. 2006; Kalirai et al. 2006a; 
Chapman et al. 2006). Interestingly, the loca- 
tion of the break in the metallicity profile coincides 
with the edge of the metal rich, extended rotating 
disk reported by Ibata et al. (2005), but it cannot 
be entirely ruled out that there is also a contribu- 
tion from the kinematic substructure (Gilbert et al. 
2007), which would bias the inner regions towards 
marginally higher metallicities. A metallicity gra- 
dient has been detected by Kalirai et al. (2006a), 
but here we show that the decline in our spec- 
troscopic measurements appears to proceed even 
stronger towards the outermost fields at 160 kpc, 
where we find mean values around and even below 
-2 dex. 

2. In particular, there is a considerable fraction of 
metal poor stars below [Fe/H]caT ^ — 2 found at 
almost all radii. Their presence is not utterly sur- 
prising, and red giants that metal poor have been 
claimed to exist in M31's halo before (e.g., Reitzel 
& Guhathakurta 2002). Furthermore, their estab- 
lished prominence in the Milky Way halo (Carney 
et al. 1996; Chiba & Beers 2000; Carollo et al. 
2007) raises the question, why there should not be 
a comparable distribution present in the M31 halo, 
if both systems had experienced a similar formation 
and accretion history. 

3. A considerable fraction of stars in the outermost 
fields beyond ~100 kpc (which corresponds to 
about 100 kpc projected distance to M33) exhibit 
velocities and metallicities consistent with those of 
M33 halo stars. This confirms earlier findings from 
star count maps (Ibata ct al. 2007), according to 
which the stellar halos of these major Local Group 
spirals overlap to a large extent. 

4. We confirm the earlier detection of a kinematically 
cold substructure (Gilbert et al. 2007), located at a 
radial distance of 15-20 kpc. By comparison of new 
N-body simulations (Mori & Rich 2008), we show 
that such a substructure is consistent with hav- 
ing originated in the merger event that produced 
M31's Giant Stellar Stream (Ibata et al. 2001, 
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2004). Nevertheless, the full radial velocity distri- 
bution along the minor axis is difficult to reconcile 
with a single collisional event of this kind, and it 
is more likely that a wealth of accretions occurred 
and formed the halo, which is concordant with the 
progressive detections of stream-like substructures 
in star count maps (Ibata et al. 2007) . 

5. There is a considerable contribution of stars from a 
genuine, ancient M31 halo, potentially ejected dur- 
ing such merger events, to the velocity distribution 
at any radius. Moreover, neither stars from the 
pristine bulge or disk components are likely to be 
found in the outer or even inner spheroids. 

This leaves us with a picture in which the progenitor 
that produced the giant Stream and thus presumably 
donated a major part of M31's halo cannot be single- 
handedly responsible for all the substructures seen in our 
and previous studies - outside the present-day Stream's 
sphere of influence, one sees a predominant occurrence 
of minor substructures or streams from many a past 
mergers. A large number of accretion events is also 
thought to have contributed to the formation of M31's 
disk (e.g., Penarrubia et al. 2006), as this would produce 
the proper, observed mix of metallicities. Ibata et al. 
(2007) note that the radial metallicity gradient is a mere 
reflection of Stream debris and numerous other substruc- 
tures in the inner regions, which is in concordance with 
model predictions. In this context the metal rich, genuine 
Stream material is more centrally concentrated, leaving 
the impression of a more extended underlying metal poor 
halo (cf. the stream distribution in Fig. 13). It is unclear 
at present, whether the gradient is intrinsic to the inner 
substructures becoming more metal poor with radius, or 
whether the underlying halo, with its larger dominance 
in the outer regions, is becoming more metal poor. Note 
that Brown et al. (2007) find that an HST field at 25 
kpc (~ 1.5°) is both more metal poor and older than the 
innermost HST field projected at 11 kpc. Hence, also 
the HST pointings appear not to be dominated by the 
canonical Stream's debris. Moreover, we find that the in- 
nermost deep HST field (Brown et al. 2003) is situated in 
a region that has some potential contribution from tidal 
debris, but appears to be dominated by a more metal rich 
" inner halo" population with a smooth velocity distribu- 
tion, free of the structures at the velocity extremes that 
are predicted to be present from interaction models (see 
left middle panel of Fig. 13). The Brown et al. (2003) 
11 kpc field appears to be free of major contamination 
from an infall event. The 21 kpc field lies just outside of 
the metal rich inner region; Brown et al. (2007) suggest 
a lower metallicity for this field, which we confirm from 
our CaT analysis. The field at 35 kpc apears to be yet 
more metal poor and may be genuinely representative of 
the outer halo. We note that all the HST fields at 11, 
21 and 35 kpc have very different circular orbital periods 
(derived from a simple mass model and the assumption of 
circular orbits) of ^0.2, 0.5 and 1 Gyr, respectively. The 
characteristics of the HST deep fields will be discussed 
in a forthcoming paper (Rich et al. in prep). 

Using the same arguments, we find large orbital peri- 
ods of ~6 Gyr at 160 kpc, which presupposes them to be 
bound to M31 (see also Majewski et al. 2007). It is hard 
to reconcile these time scales with a scenario, in which 



these distant M31 members are pressure supported. It is 
rather an attractive notion that these distant stars might 
be ejecta of collisions, not yet having completed a single 
orbit around M31. 

The fact that the new metallicity scale we framed in 
this work is in good agreement with the moderately metal 
poor character of the dSph satellites of M31 (Reitzel & 
Guhathakurta 2002; McConnachie et al. 2005; Martin et 
al. 2006b; Majewski et al. 2007) further support the idea 
that the outer halo might plausibly emanate from a pop- 
ulation similar to the dSphs. Moreover, the presence of 
the more metal rich stars beyond 100 kpc suggest an ori- 
gin of these stars in collisions rather than in a primordial 
halo. Yet it is challenging to reconcile this scenario with 
the number count maps of Ibata et al. (2007), which are 
smooth to within the sensitivity of the MegaCam survey 
in these regions and with the presence of very metal poor 
stars: there are no very metal poor stars detected in the 
Galactic dSphs (Koch et al. 2006; Helmi et al. 2006), 
nor in the faintest M31 satellites (Martin et al. 2006b, 
2007), so that it remains unclear whether systems like 
the present-day observed dwarf satellites are responsible 
for the bulk of the halo. 

While our observed radial metallicity gradient 
stretches 1.5-2 dex over the full extent of our data of 160 
kpc, simulations of the hierarchical assembly of Galactic 
halos do not reproduce any considerable gradients. In 
this context, the study of Font et al. (2006) predicts gra- 
dients in [Fe/H] averaged over each full simulated halo of 
at most 0.5 dex over a few tens of kpc. 

Our findings lend striking support to paralleling the 
M31 and Galactic halos: In a recent work Carollo et al. 
(2007) detected a dichotomy in the Milky Way's stellar 
halo. In this sense, there exists a clear kinematic and 
chemical separation into an inner halo (with stars on 
more eccentric orbits, metallicities around —1.6 dex, a 
flattened density distribution and no considerable rota- 
tion) and an outer, independent halo component, which 
is more spherical in shape, shows evidence of rotation 
and is more metal poor on average. In particular, the 
outer Galactic halo exhibits a peak metallicity of —2.2 
dex. The presence of two distinct halos is well explicable 
in the context of cosmological structure formation mod- 
els and incorporates the continuous, chaotic accretion of 
distinct subhalos that follow an earlier stage of the dis- 
sipative merging of massive, yet sub-galactic fragments 
(see Carollo et al. 2007 for a discussion of the detailed 
formation scenario) . If structure formation proceeded in 
analogy to form M31 - and in the light of ACDM this 
is likely - then the clear distinctness of at least two halo 
components visible in the MDFs in our M31 study is a 
natural outcome of the accompanying stochastic abun- 
dance accretion. 

As a concluding, independent remark, we note that a 
further benefit of the present study is the derivation of a 
new improved method to measure metallicities from the 
calcium triplet, which allowed us, and should encourage 
future works, to extract reasonably accurate information 
from low S/N spectra. 
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APPENDIX 
BACKGROUND GALAXIES 

In the full sample of DEIMOS targets taken for the entire project, we identified a still large number of ^400 galaxies 
background galaxies, which corresponds to a striking contamination fraction of ~11%. It is possible that more galaxies 
were missed during the classification of the spectra and rather flagged as bad quality data, although we verified that no 
"true" dwarf or giant star was mistaken as a galaxy spectrum and vice versa. These galaxies cover the full color range 
from ~0.5 to 3.5 in V — i', with the majority falling in the interval between 1 and 2 (see the CMD in Fig. 3). Despite 
the thorough pre-selection of potential M31 giant candidates on the upper RGB and, for some of the inner fields, 
using Washington photometry (e.g., Gilbert et al. 2006), the partially poor seeing conditions during the photometric 
runs (obtained at the KPNO; sec Osthcimcr 2003; Gilbert et al. 2006) hampered an appropriate a priori rejection of 
non-stellar extragalactic point sources. 

For those galaxies identified here, redshifts were measured from the Doppler shifts of generally 2-3 major emission 
features, such as the Balmer Ha, H/3, H7 lines, the [O II] 3727A doublet and/or the [O III] 5007A line. For rare 
occasions (^5 of the galaxies) we could also detect the strong CaH,K features in absorption. The final redshift 
distribution is displayed in Fig. 20. The first thing to note is that our sample includes a fairly broad range in redshifts, 
reaching from z ~0.04 to z ~1.35. 

This distribution should, however, not be taken as representative of the true galaxy distribution in the line of sight 
towards M31. This is due to our identification criteria based on only a small number of spectral features, which are 
then redshifted into the limited spectral range of DEIMOS (see also Kirby et al. 2007), biassing the distribution 
towards only selected redshift intervals. Hence, a detailed derivation of their physical properties and star formation 
rates has to be beyond the scope of this work and is left for a future paper (Koch et al., in prep.). 
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TABLE 1 

DEIMOS MASK OBSERVATION DETAILS 



Mask 


Date of 


P.I. 


Projected 




Mask center 


PA. 


observation 


radius a [kpc] 




(J2000.0) 


[degree] 


Minor axis fields 


fl09_l 


2005 Aug 29 


Rich 


9 (194) 


00 45 47.06 


+40 56 52.3 


23.9 


Hll.l 


2004 Sep 20 


Rich 


12 (190) 


00 46 21.94 


+40 41 42.7 


21.0 


Hll_2 


2004 Sep 20 


Rich 


12 (190) 


00 46 20.57 


+40 41 41.9 


—21.0 


f 116_1 


2005 Aug 28 


Rich 


13 (189) 


00 46 54.63 


+40 41 33.3 


22.6 


f 115_1 


2005 Aug 28 


Rich 


14 (188) 


00 47 32.51 


+40 42 05.3 


—20.0 


f 123-1 


2005 Aug 28 


Rich 


17 (185) 


00 48 05.79 


+40 29 35.5 


—20.0 


H35-1 


2005 Aug 29 


Rich 


17 (186) 


00 46 25.22 


+40 11 45.1 


-27.0 


f 130.3 
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Guhathakurta 


19 (182) 


00 48 35.01 
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Off axis spheroid fields 
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a Numbers in parentheses are projected distances from M33. 
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TABLE 2 

Number of reliable measurements per mask 



Mask 


# Targets 


# Reliable 


# M3l giant 


# M3l giants 
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a Prior to the membership separation. 15 Based on the comparison with the Besangon model. See text for details. 
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Fig. 1. — Top panel: Location of the targeted inner fields (red rectangles) on a star count map of M31 (courtesy of M. Irwin). Shown 
as blue squares are the HST fields of Brown et al. (2003, 2006, 2007). The solid black lines delineate the region in which we detect a 
break in the abundance profile (see Figs. 14-16) - this transition occurs remarkably close to the edge of the perturbed disk component in 
the star count maps. Bottom panel: Representation of the full data set in standard coordinates. Numbers at the top denote projected 
distances from M31 in kpc, also indicated by dashed circles. The individual masks are color coded by their metallicity (see Sect. 7). The 
black diamond represents the stream field H13s (Kalirai et al. 2006b), black circles are the HST fields, and M31 and M33 are indicated as 
crosses. Dashed ellipses illustrate the approximate location of the tangential streams found in the Ibata et al. (2007) maps. 
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Fig. 2. — Sample spectra of foreground dwarfs (top panel) and M31 red giants (bottom panel) with low and higher S/N ratios. The 
low S/N data (S/N=6 and 8) represent the approximate limit for which velocities and CaT metallicities could be barely determined; the 
S/N for the higher quality spectra are ~120 (top) and ~20 (bottom). Indicated are the wavelength regions of the surface gravity sensitive 
sodium doublet and the CaT. 
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Fig. 3. — Color magnitude diagrams of target objects, separated into M31 giants, foreground dwarfs and background galaxy contamination. 
Giant candidates for which we could determine spectroscopic metallicities are color coded by their [Fe/H]c a x in the leftmost panel. The 
small numbers of metal poor giants that arc plotted to lie to the blue of the bluest isochrone were carefully vetted and pass as likely M31 
giants. If these stars are excluded from the sample, our conclusions are not affected. The solid lines are isochrone sets of Girardi et al. 
(2002) for an age of 12.7Gyr and metallicities [M/H] of (left to right) -2.3, -1.7, -1.4, -1.0, -0.5, and 0.0 
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Fig. 4. — Probability distributions (PDFs) for those parameters that we used as criteria for dwarf/giant separation, i.e., Na doublet 
EW, V — i' color and radial velocity. The dashed lines each indicate the empirical PDFs defined by the full training samples, while the 
solid histograms were separated by applying the respective criteria to our full data. Thin and thick lines discriminate between dwarf and 
giant samples, respectively. Despite a considerable overlap in each individual indicator, the statistical combination of all three pieces of 
information allows for an efficient dwarf removal. The bottom right panel shows the ratio of thus separated giant to dwarf stars, separately 
for each field. See text for details. 
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Fig. 5. — Left panel: Color-velocity distribution of foreground dwarfs from the Galactic Besangon model (Robin et al. 2003), at a 
projected distance of 160 kpc to M31. The solid lines illustrate the cuts we used to assess the number of undetectable blue dwarfs in our 
M31 giant sample. Also indicated are the observed photometric errors. The right panel shows velocity distributions of our observed stars 
(shaded and dashed histograms) with V — i' < 1 and the expected blue dwarfs from the Besangon model (solid histogram). The fraction 
of unresolvable blue dwarfs in our giant sample is well below 3% and negligible. 
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AX [A] 

Fig. 6. — Illustration of the CaT coaddition in a high S/N spectrum of a red giant in a calibration globular cluster (star N3201-S11; 
S/N^llO; V=14.5mag; v H c = 485 kms" 1 ; <T,W>= 3.8A) and a low S/N DEIMOS spectrum of a M31 red giant (star 5004266; S/JV~4; 
V=21.7mag; vhc = —338 kms -1 ; <SVF>= 2.4A). The top panels display each individual Ca line, shifted to a common line center, and 
the bottom panels show the weighted coadded line (solid line), from which we measure the CaT line strengths using a Penny (i.e., Gaussian 
plus Lorentz profile) fit (dashed line). 
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Fig. 7. — Left Panel: Linestrengths of red giants in the calibration clusters from FLAMES data (Koch et al. 2006) vs. magnitude above 
the horizontal branch. The right panel shows the metallicities of these globular clusters on the scale of Carretta & Gratton (1997), with 
the residuals of the best-fit relation (eq. 6) at the bottom. Solid and dashed lines indicate these best-fit calibrations (eqs. 5,6). All these 
measurements arc based on the coadded CaT. 




Fig. 8. — Distribution of the relative error on the measured coadded line strength <T,W>, determined from Monte Carlo simulations, as a 
function of spectral quality. For our observed spectra, these values are propagated through Eqs. 5,6 to obtain the spectroscopic metallicity 
uncertainty. 
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Fig. 9. — Spectroscopic versus photometric mctallicitics for dwarf (crosses) and giant (points) candidates. Shown each are mean and la 
scatter. The dwarf stars clearly deviate from unity (solid line). Data were binned by 0.25 dex except for the most metal poor and metal 
rich bin, where we chose to include 20 stars for better sampling. 




Fig. 10. — Snapshot of the simulation data at 1 Gyr in standard coordinates and colorcoded by the particle density (Mori & Rich 2008). 
Each subpanel separately displays a different component: the accreted satellite only (top left), disk and bulge (top right), spherical halo 
(bottom left), and disk, bulge plus satellite (bottom right). 
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Fig. 11. — Density distribution (with arbitrary scaling) of radial velocities along the minor axis. The top panel displays our observed 
minor axis data within 35 kpc, whereas the bottom panel shows the distribution of the satellite particles from our simulation. Numbers at 
the top indicate distances in kpc to guide the eye. 
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Fig. 12. — Top panel: Radial velocities as a function of radial distance along the minor axis. Shown left is the the complete spectroscopic 
sample, without the removal of foreground dwarfs, within the inner 40 kpc, while the data shown in the right panel have been cleared of 
the foreground contamination using the methods described in Sect. 4. Indicated as grey shaded bars are the HST fields of Brown et al. 
(2003; 2006; 2007). The black contours delineate the distribution from our simulations (see Fig. 11, bottom panel). Numbers at the top 
indicate distances in kpc to guide the eye. The bottom left panel shows the radial variation of mean radial velocity for minor-axis (filled 
circles) and off-axis (open squares) fields. Small symbols at vhc ~ — 50 kms -1 are for the dwarf stars, while the larger symbols below 
— 180 kms^ 1 show the giant distributions. Comparison with the dotted lines at the systemic velocities of M31 (—300 kms -1 ) and M33 
(—180 kms -1 ) suggests that the giant samples in the outermost two fields appear to be suffering from contamination by M33 members 
(see also Fig. 19). The bottom right panel then shows the distribution of velocites for the full, dwarf-cleaned sample. The stream fields 
(a3, H13s) reflect in the peaks below —400 kms -1 and the peak at —520 kms -1 , both of which are reproduced by the simple orbit model 
of Ibata et al. (2004). 
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Fig. 13. — Velocity histograms along different radial bins (with the numbers representing their mean locations). The solid line represents 
our observed, dwarf-cleaned velocity sample. Shown as a shaded histogram is the velocity distribution of the stream particles drawn from 
our simulations at the same radial location as the observed fields, while the dashed thin line indicates the simulated contribution of M31 
halo particles to the velocity structures at these distances. Each distribution has been normalized to unity. 
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Fig. 14. — Left panels: Spectroscopic metallicity (on the scale of Carretta & Gratton 1997) versus radial velocity. Right panels: the same 
mctallicitics within 40 kpc as a function of radial distance. The dwarf contamination clearly stands out as a clump around —2 dex above 
> —150 kms -1 . The top (bottom) panels each display the data set before (after) removal of this contamination. Shaded regions indicate 
the HST fields of Brown ct al. (2003, 2006, 2007). For distributions of all metallicities out to 160 kpc see Fig. 15 (bottom panel). A solid 
line in the right panel has been added at [Fe/H]caT = —1 for reference. 
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Fig. 15. — A strong abundance gradient is qualitatively already visible in the top left panel, which shows the strength of the TiO band at 
710CjA in our entire giant sample (black points) versus radial distance. Note that dwarfs (red crosses) cover the full range in TiO strengths 
at all radii. The radial metallicity gradient is then clearly present in the spectroscopic MDFs in 3 different radial bins (Top right panel; 
solid lines) and for our dwarf-cleaned CaT sample (bottom panel). However, none of the distributions fully resembles the stream component 
(shown as shaded histogram). Notice that there are no metal rich stars beyond ~50 kpc. 
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Fig. 16. — Run of mean metallicity and metallicity dispersion as a function of radius. While both our data and those of Kalirai et al. 
(2006a) show the presence of a gradient, our measurements arc more metal poor on average. The top panel only includes our minor axis 
fields with a radial binning to obtain the same number of stars per bin. The points encircled and labelled by their identifier are not located 
on the minor axis. Horizontal errorbars indicate the extent of our radial binning. The open star indicates the spectroscopic mean [Fc/H] 
measurement of Reitzel & Guhathakurta (2002). In the bottom panel, we also include the off-axis fields and compute the mean mctallicitics 
separately for each field. 
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Fig. 17. — Coadded spectra, grouped according to our derived CaT metallicity. Apart from the prominent CaT lines at 8498, 8542, 
8662A(right panel), there are a number of weaker lines (mostly Fe I), which become progressively stronger in the more metal rich spectra 
(the CaT line strength < T,W > increases from 2.4A to 7.8A for the metallicity range covered in this figure). The coaddition also emphasizes 
a few a-element lines that become weaker for the more metal poor stars: note for instance the strong Ti I features at 8378, 8426, 8435A. 
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Fig. 18. — Synthetic spectra for a typical red giant, using Kurucz model atmospheres of different metallicities. The spectral resolution 
was reduced to match that of DEIMOS. Additionaly, a noise component was added to the bottom panel to illustrate a S/N ratio of 10. 
Even at these low S/N, visual ranking by the strength of a number iron lines (indicated by arrows) is possible and observable in our spectra 
(see Fig. 17). 
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Fig. 19. — Left panel: Metallicity and velocities for giant candidates in the outermost regions >40 kpc (small black dots). Different 
symbols highlight the distributions in the three outer fields. Also indicated are the velocity number distributions of the M31 halo (at 
a mean of —300 kms" 1 ) and that of M33 (around —180 kms -1 ) with arbitrary scaling. The right panel shows the respective velocity 
histograms of these outer stars. 
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Fig. 20. — Redshift distribution of background galaxies. Their respective color distribution is shown in Fig. 3. 



